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RAW SEQUENCE LISTING DATE: 09/25/2001 

PATENT APPLICATION: US/09/863,475 TIME: 10:33:32 

j? Input Set : N:\Crf3\RULE60\09863475.txt 

Output Set: N:\CRF3\09252001\I863475.raw 

SEQUENCE LISTING 

4 (1) GENERAL INFORMATION: 

6 (i) APPLICANT: LOWE, . JOHN B. 

8 (ii) TITLE OF INVENTION: METHODS AND PRODUCTS FOR THE SYNTHESIS 

9 OF OLIGOSACCHARIDE STRUCTURES ON GLYCOPROTEINS, 

10 GLYCOLIPIDS, OR AS FREE MOLECULES, AND FOR THE ISOLATION 

11 OF CLONED GENETIC SEQUENCES THAT DETERMINE THESE STRUCTU 
13 (iii) NUMBER OF SEQUENCES: 14 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: OBLON, SPIVAK, MCCLELLAND, MAIER & NEUSTADT, 

17 P.C. 

18 (B) STREET: 1755 Jefferson Davis Highway, Fourth Floor 

19 (C) CITY: Arlington 

20 (D) STATE: Virginia 

21 (E) COUNTRY: U.S.A. 

22 (F) ZIP: 22202 

24 (v) COMPUTER READABLE FORM: 

25 (A) MEDIUM TYPE: Floppy disk 

26 (B) COMPUTER: IBM PC compatible 

27 (C) OPERATING SYSTEM: PC -DOS/MS -DOS 

28 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 
30 (vi) CURRENT APPLICATION DATA: 

C--> 31 (A) APPLICATION NUMBER: US/09/863, 475 

C--> 32 (B) FILING DATE: 24-May-2001 

33 (C) CLASSIFICATION: 

35 (vii) PRIOR APPLICATION DATA: 

36 (A) APPLICATION NUMBER: 07/914,281 

37 (B) FILING DATE: 

41 (Viii) ATTORNEY/AGENT INFORMATION: 

42 (A) NAME: Lavalleye, Jean-Paul M. P. 

43 (B) REGISTRATION NUMBER: 31,451 

44 (C) REFERENCE/DOCKET NUMBER: 2363-060-55 

46 (ix) TELECOMMUNICATION INFORMATION: 

47 (A) TELEPHONE: (703)521-4 500 

48 (B) TELEFAX: (703)486-2347 

49 (C) TELEX: 248855 OPAT UR 
52 (2) INFORMATION FOR SEQ ID NO : 1: 

54 (i) SEQUENCE CHARACTERISTICS: 

55 (A) LENGTH: 2043 base pairs 

56 (B) TYPE: nucleic acid 

57 (C) STRANDEDNESS : unknown 

58 (D) TOPOLOGY: unknown 
60 (ii) MOLECULE TYPE: cDNA 
62 (iv) ANTI- SENSE: NO 

66 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

68 AGGAAACCTG CCATGGCCTC CTGGTGAGCT GTCCTCATCC ACTGCTCGCT GCCTCTCCAG 60 
70 ATACTCTGAC CCATGGATCC CCTGGGTGCA GCCAAGCCAC AATGGCCATG GCGCCGCTGT 120 
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RAW SEQUENCE LISTING DATE: 09/25/2001 

PATENT APPLICATION: US/09/863 , 475 TIME: 10:33:32 

Input Set : N:\Crf3\RULE60\09863475.txt 
Output Set: N:\CRtf3\09252001\I863475.raw 

72 CTGGCCGCAC TGCTATTTCA GCTGCTGGTG GCTGTGTGTT TCTTCTCCTA CCTGCGTGTG 180 

74 TCCCGAGACG ATGCCACTGG ATCCCCTAGG GCTCCCAGTG GGTCCTCCCG ACAGGACACC 24 0 

76 ACTCCCACCC GCCCCACCCT CCTGATCCTG CTATGGACAT GGCCTTTCCA CATCCCTGTG 300 

78 GCTCTGTCCC GCTGTTCAGA GATGGTGCCC GGCACAGCCG ACTGCCACAT CACTGCCGAC 360 

80 CGCAAGGTGT ACCCACAGGC AGACACGGTC ATCGTGCACC ACTGGGATAT CATGTCCAAC 420 

82 CCTAAGTCAC GCCTCCCACC TTCCCCGAGG CCGCAGGGGC AGCGCTGGAT CTGGTTCAAC 480 

84 TTGGAGCCAC CCCCTAACTG CCAGCACCTG GAAGCCCTGG ACAGATACTT CAATCTCACC 540 

86 ATGTCCTACC GCAGCGACTC CGACATCTTC ACGCCCTACG GCTGGCTGGA GCCGTGGTCC 600 

88 GGCCAGCCTG CCCACCCACC GCTCAACCTC TCGGCCAAGA CCGAGCTGGT GGCCTGGGCG 660 

90 GTGTCCAACT GGAAGCCGGA CTCAGCCAGG GTGCGCTACT ACCAGAGCCT GCAGGCTCAT 720 

92 CTCAAGGTGG ACGTGTACGG ACGCTCCCAC AAGCCCCTGC CCAAGGGGAC CATGATGGAG 780 

94 ACGCTGTCCC GGTACAAGTT CTACCTGGCC TTCGAGAACT CCTTGCACCC CGACTACATC 84 0 

96 ACCGAGAAGC TGTGGAGGAA CGCCCTGGAG GCCTGGGCCG TGCCCGTGGT GCTGGGCCCC 900 

98 AGCAGAAGCA ACTACGAGAG GTTCCTGCCA CCCGACGCCT TCATCCACGT GGACGACTTC 960 

100 CAGAGCCCCA AGGACCTGGC CCGGTACCTG CAGGAGCTGG ACAAGGACCA CGCCCGCTAC 1020 

102 CTGAGCTACT TTCGCTGGCG GGAGACGCTG CGGCCTCGCT CCTTCAGCTG GGCACTGGAT 1080 

104 TTCTGCAAGG CCTGCTGGAA ACTGCAGCAG GAATCCAGGT ACCAGACGGT GCGCAGCATA 1140 

106 GCGGCTTGGT TCACCTGAGA GGCCGGCATG GTGCCTGGGC TGCCGGGAAC CTCATCTGCC 1200 

108 TGGGGCCTCA CCTGCTGGAG TCCTTTGTGG CCAACCCTCT CTCTTACCTG GGACCTCACA 1260 

110 CGCTGGGCTT CACGGCTGCC AGGAGCCTCT CCCCTCCAGA AGACTTGCCT GCTAGGGACC 1320 

112 TCGCCTGCTG GGGACCTCGC CTGTTGGGGA CCTCACCTGC TGGGGACCTC ACCTGCTGGG 1380 

114 GACCTTGGCT GCTGGAGGCT GCACCTACTG AGGATGTCGG CGGTCGGGGA CTTTACCTGC 1440 

116 TGGGACCTGC TCCCAGAGAC CTTGCCACAC TGAATCTCAC CTGCTGGGGA CCTCACCCTG 1500 

118 GAGGGCCCTG GGCCCTGGGG AACTGGCTTA CTTGGGGCCC CACCCGGGAG TGATGGTTCT 1560 

120 GGCTGATTTG TTTGTGATGT TGTTAGCCGC CTGTGAGGGG TGCAGAGAGA TCATCACGGC 1620 

122 ACGGTTTCCA GATGTAATAC TGCAAGGAAA AATGATGACG TGTCTCCTCA CTCTAGAGGG 1680 

124 GTTGGTCCCA TGGGTTAAGA GCTCACCCCA GGTTCTCACC TCAGGGGTTA AGAGCTCAGA 1740 

126 GTTCAGACAG GTCCAAGTTC AAGCCCAGGA CCACCACTTA TAGGGTACAG GTGGGATCGA 1800 

128 CTGTAAATGA GGACTTCTGG AACATTCCAA ATATTCTGGG GTTGAGGGAA ATTGCTGCTG 1860 

130 TCTACAAAAT GCCAAGGGTG GACAGGCGCT GTGGCTCACG CCTGTAATTC CAGCACTTTG 1920 

132 GGAGGCTGAG GTAGGAGGAT TGATTGAGGC CAAGAGTTAA AGACCAGCCT GGTCAATATA 1980 

134 GCAAGACCAC GTCTCTAAAT AAAAAATAAT AGGCCGGCCA GGAAAAAAAA AAAAAAAAAA 2040 



136 AAA 2043 
138 (2) INFORMATION FOR SEQ ID NO: 2: 

140 (i) SEQUENCE CHARACTERISTICS: 

141 (A) LENGTH: 361 amino acids 

142 (B) TYPE: amino acid 
14 3 (D) TOPOLOGY: unknown 
145 (ii) MOLECULE TYPE: protein 

149 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

151 Met Asp Pro Leu Gly Ala Ala Lys Pro Gin Trp Pro Trp Arg Arg Cys 

152 15 10 15 

154 Leu Ala Ala Leu Leu Phe Gin Leu Leu Val Ala Val Cys Phe Phe Ser 

155 20 25 30 

157 Tyr Leu Arg Val Ser Arg Asp Asp Ala Thr Gly Ser Pro Arg Ala Pro 

158 35 40 45 

160 Ser Gly Ser Ser Arg Gin Asp Thr Thr Pro Thr Arg Pro Thr Leu Leu 

161 50 55 60 

163 lie Leu Leu Trp Thr Trp Pro Phe His lie Pro Val Ala Leu Ser Arg 
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DATE: 09/25/2001 
SEQUENCE LISTING 475 TIME: 10:33:32 

PATENT APPLICATION. US/u*/ 



raw 

80 



1« » „ t V.! Pro CIV Thr M» ASP 5- 116 ™ 

SI cys ser 01» »« »° _^ _ ^ ^ jw val Hls ^ Irp w 



val IJt o », tb. « a- ™ r» 

169 W^" 1 '.I' ■ ^ £ pro Pro ser Pro u. Pro .» 

170 t OQT . A „ n pro Lys Ser Arg Leu ^ 5 

172 Met ?5 ASD 120 pro pro Asn cys Gin 

173 , \H TrD He Trp Phe Asn Leu Glu Pro p 

Gly Gin Arg Trp 140 Arg 

* 76 r Glu Ala Leu Asp Arg Tyr Phe Asn Leu Thr lM 

178 His Leu Glu i5Q 155 ^ Trp ger 

179 14 5 Tn ~ phe Thr Pro Tyr Gly Trp u 
S ser Asp Ser Asp lie 

182 D n Ala His Pro Pro Leu Asn Leu Ser AX 

184 Gly Gin Pro Ala 5 ^ ^ Arg Val Arg 

185 , »i» Tro Ala Val Ser Asn Trp Lys *r v 2Q5 

187 Val Ala Trp Ala 2Q0 Qly Arg 

188 .f n ser Leu Gin Ala His Leu Lys Val ASP 

190 Tyr Tyr Gin ser j ser Arg 
iQi 21° lvs Gly Thr Met Met Glu inr 

1 91 t,-^ tvq Pro Leu Pro Lys ^->-z ~ 35 

193 Ser Hxs Lys 235 Tyr Ile 

194 225 Phe Glu Asn Ser Leu ax 
f 9 4 6 Tyr Lys Phe Tyr Leu 

197 T c Tpu Trp A rg Asn Ala Leu Glu Ala T P 

199 Thr Glu Lys Leu Trp ^ ^ ^ ^ Asp 

200 _ lv Pro ser Arg Ser Asn Tyr ^xu * 

Val Leu Gly 280 Agp Leu Ala A rg 

203 "I His va i asp Asp Phe Gin Ser 1 

205 Ala Ph ? 116 H 295 . _ *° Leu Ser Tyr Phe 

206 ? PU Gin Glu Leu Asp Lys Asp Hxs Ala Arg Ty 32Q 

208 Tyr Leu Gin Glu 315 ^ Leu Asp 

209 305 Ara Glu Thr Leu Arg Pro Arg Ser Phe 335 
Arg Trp Arg Glu T 330 Arg Tyr Gln Th r 

l\] Phe cys Lys Ala Cys Trp Lys Leu Gin Gin 

2 2 f 7 val Arg Ser Ala Ala Tr P Phe Thr 

(i) SEQUENCE^CHA ^ lr8 

223 pE . nuc ieic acxd 

224 C STRANDEDNESS : unknown 

225 D TO p OLOG Y: unknown 
HI (ii) MOLECULE TYPE: cDNA 
III XV ANTI-SENSE: NO 



S iSE MS SSSS SSSS S£-» — 



60 
120 
180 



9/25A 



;: //C:\Crf3\OutholdvVsrI863475J 



• 



• 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/863,475 



DATE: 09/25/2001 
TIME: 10:33:32 



Input Set : N:\Crf3\RULE60\09863475.txt 
Output Set: N:\CRF3\09252001\I863475.raw 



242 CTGACGATGA GGCTGACTTT 

244 AGGCCTGTAC TACATTTGCC 

246 CATGTCAACA AGATCTCCAT 

248 TCTGGATCAC AGGAGAAAAT 

250 GTCTCAACCG TGGTTGTCGT 

252 AACAGATGGC AGAAGGACTG 

254 CAAGAAGACA ACGTAGAAGG 

256 CCTCAGCTAT GGGACTGGTT 

258 TGGAAGGCGC CGATTGTGTG 

260 GCCACACAGA AACTCACTGT 

262 TACTTAGAAG ACTTTCTGGA 

264 TTTTACGTCA TGATAGACGA 

266 TCCTTACAAG TCTTTGAGAT 



GAACTCAAGA 
TGGACCTAAG 
GTCAAGATCC 
AATGAATGTC 
GTTTTGGGAA 
GTGGTTCCCA 
ACGGAGAGAA 
CAATCCAAAG 
GGAAGGCACT 
GGGGCTGACA 
GTCTGCTGAC 
CACCTCCCGG 
CAGGTCTGAG 



GATCTGCTTA 
ATTTTCATGA 
AAGTCAGAAA 
AAGGGAAAAG 
TATGTCAACA 
AGCTGGTTTA 
AAGGGTAGAA 
AACCGCCCGG 
TATGACACAG 
GTGTTTGCTG 
ATGTACTTCA 
ATGCCTGTCG 
AAGAGGTGGC 



CCCCAGTCTC 
TCACTATGCT 
CAAGTCTTCC 
TAATCCTGTT 
GAATTCCAGA 
AAAATGGGAC 
ATGGAGATCG 
ATGTTTTGAC 
CTCTGCTGGA 
TGGGAAAGTA 
TGGTTGGCCA 
TGCACCTGAA 
AGGATATCAG 



CTGGAATTAA 
TCAAGATCTC 
ATCCTCAAGA 
GATGCTGATT 
GGTTGGTGAG 
CCACAGTTAT 
CATTGAAGAG 
AGTGACCCCG 
AAAGTACTAC 
CATTGAGCAT 
TCGGGTCATA 
CCCTCTACAT 
CATGATGCGC 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



268 


ATGAAGACCA TTGGGGAGCA 


CATCCTGGCC 


CACATCCAGC 


ACGAGGTCGA CTTCCTCTTC 


1020 


270 


TGCATGGACG TGGATCAAGT 


CTTTCAAGAC 


AACTTCGGGG 


TGGAAACTCT GGGCCAGCTG 


1080 


272 


GTAGCACAGC TCCAGGCCTG 


GTGGTACAAG 


GCCAGTCCCG 


AGAAGTTCAC CTATGAGAGG 


1140 


274 


CGGGAACTGT CGGCCGCGTA 


CATTCCATTC 


GGAGAGGGGG 


ATTTTTACTA CCACGCGGCC 


1200 


276 


ATTTTTGGAG GAACGCCTAC 


TCACATTCTC 


AACCTCACCA 


GGGAGTGCTT TAAGGGGATC 


1260 


278 


CTCCAGGACA AGAAACATGA 


CATAGAAGCC 


CAGTGGCATG 


ATGAGAGCCA CCTCAACAAA 


1320 


280 


TACTTCCTTT TCAACAAACC 


CACTAAAATC 


CTATCTCCAG 


AGTATTGCTG GGACTATCAG 


1380 


282 


ATAGGCCTGC CTTCAGATAT 


TAAAAGTGTC 


AAGGTAGCTT 


GGCAGACAAA AGAGTATAAT 


1440 


284 


TTGGTTAGAA ATAATGTCTG 


ACTTCAAATT 


GTGATGGAAA 


CTTGACACTA TTTCTAACCA 


1500 


287 


(2) INFORMATION FOR SEQ 


ID NO: 4 : 




















289 


(i) 


SEQUENCE CHARACTERISTICS: 


















290 




(A) 


LENGTH: 


394 amino acids 
















291 




(B) 


TYPE: amino acid 




















292 




(D) 


TOPOLOGY: i 


unknown 




















294 


(ii) 


MOLECULE TYPE: ] 


protein 




















298 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


: 4: 














300 


Met 


He 


Thr 


Met 


Leu 


Gin 


Asp 


Leu 


His 


Val 


Asn 


Lys 


He 


Ser 


Met Ser 




301 


1 








5 










10 










15 




303 


Arg 


Ser 


Lys 


Ser 


Glu 


Thr 


Ser 


Leu 


Pro 


Ser 


Ser 


Arg 


Ser 


Gly 


Ser Gin 




304 








20 










25 










30 






306 


Glu 


Lys 


He 


Met 


Asn 


Val 


Lys 


Gly 


Lys 


Val 


He 


Leu 


Leu 


Met 


Leu He 




307 






35 










40 










45 








309 


Val 


Ser 


Thr 


Val 


Val 


Val 


Val 


Phe 


Trp 


Glu 


Tyr 


Val 


Asn 


Arg 


He Pro 




310 




50 










55 










60 










312 


Glu 


Val 


Gly 


Glu 


Asn 


Arg 


Trp 


Gin 


Lys 


Asp 


Trp 


Trp 


Phe 


Pro 


Ser Trp 




313 


65 










70 










75 








80 




315 


Phe 


Lys 


Asn 


Gly 


Thr 


His 


Ser 


Tyr 


Gin 


Glu 


Asp 


Asn 


Val 


Glu 


Gly Arg 




316 










85 










90 










95 




318 


Arg 


Glu 


Lys 


Gly 


Arg 


Asn 


Gly 


Asp 


Arg 


He 


Glu 


Glu 


Pro 


Gin 


Leu Trp 




319 








100 










105 










110 






321 


Asp 


Trp 


Phe 


Asn 


Pro 


Lys 


Asn 


Arg 


Pro 


Asp 


Val 


Leu 


Thr 


Val 


Thr Pro 




322 






115 










120 










125 








324 


Trp 


Lys 


Ala 


Pro 


He 


Val 


Trp 


Glu 


Gly Thr 


Tyr 


Asp 


Thr 


Ala 


Leu Leu 




325 




130 










135 










140 










327 


Glu 


Lys 


Tyr 


Tyr 


Ala 


Thr 


Gin 


Lys 


Leu 


Thr 


Val 


Gly 


Leu 


Thr 


Val Phe 




328 


145 










150 










155 








160 
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343 
345 
346 
348 
349 



DATE: 09/25/2001 
RAW SEQUENCE LISTING 4?5 TIME : 10:33:32 

PATENT APPLICATION: US/09/863 , 47 

330 sll v. G1 , «. «u » ls *r - - - «- - ;s s ;; 

3 165 . , , le p he Tyr Val Met 

3 " ». *SP Met Tyr Phe Met v.l Gly Hi. . »r, val ^ 

Ile isP s ~ - - s - - - - » - 

ser Leu iS V. «- - »| « ~ - -s «J «, - - - 

SK S « «. - - - - - - £ Ile Le ° lla Hls ? 

245 . T riv cin Leu Val Ala Gin Leu 

Gin Asp Asn Phe Gly Val Glu Thr Leu Gly Gin ^ 

S - 2 - - - s 116 "° phe Gly S Gly isP I Z 

i Iyl ;s »u ... «- s - - - - iie - s 

£ «, ex. cy S - £ .xr xx. ~ - - - ~ - xx* 

360 Thr Arg 330 Leu phe 

3 3 S Glu Ala Gin Trp His Asp Glu Ser Hi- Leu Asn Lys Ty 

Asn L ys Pro i£ Lys Tie Leu Ser Pro Glu T yr Cys Trp As P Tyr Gin 
67 355 Tte lvs Ser Val Lys Val Ala Trp Gin Thr 

369 lie Gly Leu Pro ser asp lie Lys Ser ^ 

3 70 370 ™ tpu Val Arg Asn Asn Val 
372 Lys Glu Tyr Asn Leu Val a g 

III m INFORMATION FOR SEQ ID NO: 5: 

3 ? 77 ( } (i) SEQUENCE CHARACTERISTICS : 

377 ( } (A) LENGTH : 8174 base pairs 

37 1 (B) TYPE: nucleic acid 
319 n c STRANDEDNESS : unknown 
380 TOPOLOGY: unknown 
383 (ii) MOLECULE TYPE: DNA (genomic) 

( iv) ANTI-SENSE: NO ^ ^ 5 . mm ^^ TRC 60 



^Sss^TSEs'jssss ssss 
ssss ss sssss ~ ---g ssss 
ssss sssss =i lis sssss = 

- SSSSS SSSS SEE. -g-s ss JESS 

2 iii iiii SEE ssss 5SS- 



120 

180 

240 

300 

360 

420 

480 

540 

600 
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VERIFICATION SUMMARY DATE: 09/25/2001 

PATENT APPLICATION: US/09/863,475 TIME: 10 : 33 : 33 

Input Set : N:\Crf3\RULE60\09863475.txt 
Output Set: N:\CRF3\09252001\I863475.raw 

L:31 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:32 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
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